Higher classification sensitivity of short metagenomic reads with CLARK-S Supplementary Material
نویسندگان
چکیده
منابع مشابه
Higher classification sensitivity of short metagenomic reads with CLARK-S
The growing number of metagenomic studies in medicine and environmental sciences is creating increasing demands on the computational infrastructure designed to analyze these very large datasets. Often, the construction of ultra-fast and precise taxonomic classifiers can compromise on their sensitivity (i.e. the number of reads correctly classified). Here we introduce CLARK-S, a new software too...
متن کاملHigher Classification Accuracy of Short Metagenomic Reads by Discriminative Spaced k-mers
The growing number of metagenomic studies in medicine and environmental sciences is creating new computational demands in the analysis of these very large datasets. We have recently proposed a timeefficient algorithm called Clark that can accurately classify metagenomic sequences against a set of reference genomes. The competitive advantage of Clark depends on the use of discriminative contiguo...
متن کاملdeSPI: efficient classification of metagenomic reads with lightweight de Bruijn graph-based reference indexing
Summary: In metagenomic studies, fast and effective tools are on wide demand to implement taxonomy classification for upto billions of reads. Herein, we propose deSPI, a novel read classification method that classifies reads by recognizing and analyzing the matches between reads and reference with de Bruijn graph-based lightweight reference indexing. deSPI has faster speed with relatively small...
متن کاملA novel data structure to support ultra-fast taxonomic classification of metagenomic sequences with k-mer signatures
Motivation Metagenomic read classification is a critical step in the identification and quantification of microbial species sampled by high-throughput sequencing. Although many algorithms have been developed to date, they suffer significant memory and/or computational costs. Due to the growing popularity of metagenomic data in both basic science and clinical applications, as well as the increas...
متن کاملCSSSCL: a python package that uses combined sequence similarity scores for accurate taxonomic classification of long and short sequence reads
SUMMARY Sequence comparison of genetic material between known and unknown organisms plays a crucial role in genomics, metagenomics and phylogenetic analysis. The emerging long-read sequencing technologies can now produce reads of tens of kilobases in length that promise a more accurate assessment of their origin. To facilitate the classification of long and short DNA sequences, we have develope...
متن کامل